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24. The following fees are submitted;. 

BASIC NATIONAL FEE ( 37 CFR 1.492 (a) (1) - (5)) : 

□ Neither international preliminary examination fee (37 CFR 1 .482) nor 
international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO 



□ 



□ 



International preliminary examination fee (37 CFR 1.482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 
but international search fee (37 CFR 1.445(a)(2)) paid to USPTO 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 
but all claims did not satisfy provisions of PCT Article 33(1 )-(4) 



□ International preliminary examination fee (37 CFR 1 .482) paid to USPTO 
and all claims satisfied provisions of PCT Article 33(l)-(4) 



$1000.00 
$860.00 
$710.00 
$690.00 
$100.00 



ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



$860.00 



Surcharge of $130.00 for furnishing the oath or declaration later than 
months from the earliest claimed priority date (37 CFR 1 .492 (e)). 



□ 20 



□ 30 



$0.00 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total claims 
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0 



$18.00 



$0.00 



Independent claims 



- 3 = 



$80.00 



$0.00 



Multiple Dependent Claims (check if applicable). 



SI 



$270.00 



TOTAL OF ABOVE CALCULATIONS = 



$1,130.00 



□ Applicant claims small entity status. (See 37 CFR 1.27). The fees indicated above are 
reduced by 1/2. 



$0.00 



SUBTOTAL = 



$1,130.00 



Processing fee of $130.00 for furnishing the English translation later than □ 20 
months from the earliest claimed priority date (37 CFR 1 .492 (f)). 



□ 30 



$0.00 



TOTAL NATIONAL FEE = 



$1,130.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31) (check if applicable). 



$0.00 



TOTAL FEES ENCLOSED = 



$1,130.00 



Amount to be: 
refunded 



charged 



a. 
b. 



□ 
□ 



$1,130.00 



to cover the above fees is enclosed, 
in the amount of 



to cover the above fees. 



A check in the amount of 

Please charge my Deposit Account No. 
A duplicate copy of this sheet is enclosed. 

The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment 
to Deposit Account No. 15-0030 A duplicate copy of this sheet is enclosed. 

Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 
information should not be included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 





. R WILLIAM E , BF A 11M 0 NT 

^ignatureREGISTRATION NUMBER 30,99 5 



Norman F. Obion 



NAME 
24,618 



REGISTRATION NUMBER 



Page 2 of 2 



Docket No.: 212833US0PCT 

IN THE UNITED STATES PATENT & TRADEMARK OFFICE 
IN RE APPLICATION OF: : 
Takami MAEKAWA, et al. : 

SERIAL NO: 09/926„028 : ATTN: BOX SEQUENCE 

FILED: AUGUST 16, 2001 : 

FOR: METHOD FOR ANALYZING EXPRESSION FREQUENCIES OF GENES 

PRELIMINARY AMENDMENT AND STATEMENT 

ASSISTANT COMMISSIONER FOR PATENTS 
WASHINGTON, D.C. 20231 

SIR: 

In response to the Office Communication mailed March 22, 2002, please amend the 
above-identified application as follows: 

IN THE SPECIFICATION 
Please delete the original Sequence Listing. 

Please replace the paragraph beginning on page 14, line 13, with the following text: 
Fig. 1 shows an exemplary structure of plasmid DNA for producing a vector primer 
(MAGE/pUC19). The structure of the MAGE/pUC19 is shown having a multiple cloning 
sequence of GATTCGTGC AGATCTC ACACTGC AGAGATCC AAC AGC ATGGAAGCTT 
(plus strand; SEQ ID NO: 1) and the corresponding minus strand having a sequence (5' to 3') 
of AAGCTTCCATGCTGTTGGATCTCTGCAGTGTGAGATCTGCACGAATTC (SEQ ID 
NO: 1 7). In this, and subsequent, figure(s), the term "plus strand" represents the top 
sequence, which is shown in the figure in 5' to 3' orientation, and the term "minus strand" 
represents the bottom sequence, which is shown in the figure in 3' to 5' orientation. 



Please replace the paragraph beginning on page 14, line 16, with the following text: 
Fig. 2 shows an exemplary structure of a vector primer and a construction process 
therefor. PstI and BstXI cleavage results in the following DNA fragments (all listed 5' to 3') 
shown in Fig. 2B: GAATTCGTGCAGATCTCACACTGCA (SEQ ID NO: 18) 
GAGATCCAACAGC (SEQ ID NO: 19) ATGG A AG CTT (SEQ ID NO: 20) 
GTGTGAGATCTGCACGAATTC (SEQ ID NO: 21) TTGGATCTCTGCA (SEQ ID NO: 
22) AAGCTTCCATGCTG (SEQ ID NO: 23). In Fig. 2C, a partially double-stranded DNA 
having a single-stranded poly(T) sequence is introduced having the following DNA 
sequences (all listed 5' to 3'): CGGAGTTTAAACGGATTGGAGCCAGC (SEQ ID NO: 24) 
GCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 
TTT (SEQ ID NO: 25). In Fig. 2D, ligation results in the following DNA sequences (all 
listed 5' to 3'): CGGAGTTTAAACGGATTGGAGCCAGCATGGAAGCTT (SEQ ID NO: 
26) 

AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTTT 
TTTTTTTTTTTTTTTTTT (SEQ ID NO: 27). 

Please replace the paragraph beginning on page 14, line 18, with the following text: 
Fig. 3 schematically shows the steps (a) and (b) of the method of the present 

invention. In Fig. 3E, the sequence (5' to 3') 

-AAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO: 28) is introduced to 
AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTTT 
TTTTTTTT (5' to 3' sequence; SEQ ID NO: 29). The resulting representative synthesized 
DNA in Fig. 3F has the sequence (5' to 3'): 
NNNNNNNNNNNNNNNNNNN^ 

AAAAAAAAAAAAAAAACGGAGTTTAAACGGATTGGAGCCAGCATGGAAGCTT 
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(plus strand; SEQ ID NO: 30) and 

AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTN 

NNNNNNNNNNNNNNNNN^ (minus 

strand; SEQ ID NO: 31). 

Please replace the paragraph beginning on page 14, line 20, with the following text: 
Fig. 4 schematically shows the steps (c) and (d) of the method of the present 

invention. In Fig. 41, the represented DNA sequence has the following sequence (5' to 3'): 

GAATTCGTGCAGATCWJMvfNTvn^^ 

AAAAAAACGGAGTTTAAACGGATTGGAGCCAGCATGGAAGCTT (plus strand; SEQ 
ID NO: 32) and 

AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTNNNNN 
NNNNNNNNTWNNNNOT (plus strand; SEQ ID 

NO: 33). Following Bsgl and Pmel treatment, Fig. 4J shows the following DNA fragments 
(all listed 5' to 3'): GAATTCGTGCAGATCNNNNNlvlNNNNrNNN (SEQ ID NO: 34) 
Nl^NNNNNNNNNNNNNNNAAAAAAAAAAAAAAACGGAGTTT (SEQ ID NO: 35) 
AAACGGATTGGAGCCAGCATGGAAGCTT (SEQ ID NO: 36) 
NNNNNNNNNNNGATCTGCACGAATTC (SEQ ID NO: 37) 

AAACTCCGTTTTTTTTTTTTTTTNN^^ (SEQ ID NO: 38) 

AAGCTTCCATGCTGGCTCCAATCCGTTT (SEQ ID NO: 39). Following 1) blunting of 
ends and 2) ligation and cyclization Fig. 4K provides the following exemplary sequence (5' to 
3'): 

GAATTCGTGCAGATCNNNNNNNNNNN AAACGGATTGGAGCCAGCATGGAAGCTT 
(plus strand; SEQ ID NO: 40) and 

AAGCTTCCATGCTGGCTCCAATCCGTTTNNNNNNNNNNNGATCTGCACGAATTC 
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(bottom strand; SEQ ID NO: 41). In Fig. 4L, a exemplary PCR amplification product is 
provided with the following sequence (5' to 3'): GATCNNNNNNNNNNNAAACGGATC 
(plus strand; SEQ ID NO: 42) and GATCCGTTTNNNNNNNNNNNGATC (minus strand; 
SEQ ID NO: 43). 

Please replace the paragraph beginning on page 14, line 22, with the following text: 
Fig. 5 schematically shows the step (e) of the method of the present invention and a 
step of inserting an amplification product obtained in the step (e) into a cloning vector for 
sequencing. The small DNA fragment (listed 5* to 3') is GATCNNNNNNNNNNNAAACG 
(plus strand; SEQ ID NO: 44) and GATCCGTTTNNNNNNNNNNN (minus strand; SEQ ID 
NO: 45). 

At page 38 (Abstract), after the last line, beginning on a new page, please insert the 
attached substitute Sequence Listing. 

REMARKS 

Claims 1-8 are pending in the present application. 

Applicants have now submitted a substitute Sequence Listing and a corresponding 
computer-readable Sequence Listing. The sequence information recorded in the 
corresponding computer-readable Sequence Listing is identical to the paper copy of the 
substitute Sequence Listing. Support for all of the sequences listed in the substitute 
Sequence Listing is found in the present application as originally filed. No new matter is 
believed to have been introduced by the submission of the substitute Sequence Listing and 
the corresponding computer-readable Sequence Listing. 

Applicants submit that a surcharge for filing the application in the absence of an Oath 
or Declaration is not required. Applicants submit herewith a copy of the Declaration as 
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originally filed, along with a copy of the date-stamped filing receipt evidencing timely filing 
thereof. In the event that the Office determines that a surcharge is required, the 
Commissioner is hereby authorized to charge any additional fees to Deposit Account No. 
15-0030 . 

Applicants submit that the present application is ready for examination on the merits. 

Early notice to this effect is earnestly solicited. 

Respectfully submitted, 

OBLON, SPIVAK, McCLELLAND, 
MAIER & NEUSTADT, P.C. 




Norman F. Obion 
Attorney of Record 
Registration No. 24,618 



Vincent K. Shier 
Registration No. 50,552 
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MARKED-UP COPY 

IN THE SPECIFICATION 
Please delete the original Sequence Listing. 

Please replace the paragraph beginning on page 14, line 13, with the following text: 
Fig. 1 shows an exemplary structure of plasmid DNA for producing a vector primer 
(MAGE/pUC19). The structure of the MAGE/pUC19 is shown having a multiple cloning 
sequence of GATTCGTGC AG ATCTC AC ACTGC AGAGATCC A AC AGC ATGGA AGCTT 
(plus strand: SEP ID NO: 1) and the corresponding minus strand having a sequence (5' to 3') 
of AAGCTTCCATGCTGTTGGATCTCTGCAGTGTGAGATCTGCACGAATTC fSEO ID 
NO: 17). In this, and subsequent. figurefs). the term "plus strand" represents the top 
sequence, which is shown in the Figure in 5 ? to 3' orientation, and the term "minus strand" 
represents the bottom sequence, which is shown in the figure in 3 T to 5' orientation. 

Please replace the paragraph beginning on page 14, line 16, with the following text: 
Fig. 2 shows an exemplary structure of a vector primer and a construction process 
therefor. PstI and BstXI cleavage results in the following DNA fragments (all listed 5' to 3') 
shown in Fig. 2B: GAATTCGTGCAGATCTCACACTGCA (SEP ID NO: 18) 
GAGATCCAACAGC TSEO ID NO: 19) ATGGAAGCTT fSEO ID NO: 20) 
GTGTGAGATCTGCACGAATTC (SEP ID NO: 21) TTGGATCTCTGCA TSEO ID NO: 
22) AAGCTTCCATGCTG (SEP ID NO: 23). In Fig. 2C. a partially double-stranded DNA 
having a single-stranded poly(T) sequence is introduced having the following DNA 
sequences (all listed 5' to 3'): CGGAGTTTAAACGGATTGGAGCCAGC ( SEP ID NO: 24) 
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GCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 
TTT (SEP ID NO: 25V In Fig. 2D. ligation results in the following DNA sequences (all 
listed 5' to 3'Y. CGGAGTTTAAACGGATTGGAGCCAGCATGGAAGCTT (SEP ID NO: 
26) 

AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTTT 
TTTTTTTTTTTTTTTTTT (SEP ID NP: 27). 

Please replace the paragraph beginning on page 14, line 18, with the following text: 
Fig. 3 schematically shows the steps (a) and (b) of the method of the present 

invention. In Fig. 3E. the sequence (5' to 3') 

- A A A A A A A A AAA AA A A A A A A A A A A A (SEP ID NP: 28^ is introduced to 
AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTTT 
TTTTTTTT (5' to 3' sequence: SEP ID NP: 29). The resulting representative synthesized 
DNA in Fig. 3F has the sequence (5' to 3'): 
NNNNNNNNNNNNNNNNNNNN^ 

AAAAAAAAAAAAAAAACGGAGTTTAAACGGATTGGAGCCAGCATGGAAGCTT 
(plus strand: SEP ID NP: 30) and 

AAGCTTCCATGCTGGCTCCAATCCGTTTAAACTCCGTTTTTTTTTTTTTTTTTTTTN 

NNNNNNNNNNNNNNNNNNNNNN^ (minus 

strand: SEP ID NP: 31). 

Please replace the paragraph beginning on page 14, line 20, with the following text: 
Fig. 4 schematically shows the steps (c) and (d) of the method of the present 

invention. In Fig. 41. the represented DNA sequence has the following sequence (5' to 3'): 

GAATTCGTGCAGATCNNNNNNTsTNn^ 

AAAAAAACGGAGTTTAAACGGATTGGAGCCAGCATGGAAGCTT (plus strand: SEP 
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ID NO: 32) and 



NTVINNNNNNNNNNNNNNNNNN^ (plus strand: SEP ID 

NO: 33). Following Bsgl and Pmel treatment. Fig. 4J shows the following DNA fragments 
fall listed 5' to 3'): GAATTCGTGCAGATCNNNNNNNNNNNNN (SEP ID NO: 34) 
NNNNNNNNNNNNTvmNTWAAAAAAAAAAAAAA (SEP ID NO: 35) 

AAACGGATTGGAGCCAGCATGGAAGCTT (SEP ID NP: 36) 
NTsnWNNNNNNNGATCTGCACGAATTC (SEP ID NP: 37) 

AAACTCCGTTTTTTTTTTTTTTTNNKNNNNNNNNNNNN^ (SEP ID NP: 38) 
AAGCTTCCATGCTGGCTCCAATCCGTTT (SEP IDNP: 39V Following 1) blunting of 
ends and 2) ligation and cyclization Fig. 4K provides the following exemplary sequence (5' to 
30i 

GAATTCGTGCAGATCNNNNNNNNNNNAAACGGATTGGAGCCAGCATGGAAGCTT 
(plus strand: SEP ID NP: 40) and 
AAGCTTCCATGCTGGCTCCAATCCGTTTNNT^^ 

(bottom strand: SEP ID NP: 41). In Fig. 4L. a exemplary PCR amplification product is 
provided with the following sequence (5' to 3'): GATCNNNN1STNNNNNNAAACGGATC 
(plus strand: SEP ID NP: 42) and GATCCGTTTNNNNNNNNNNNGATC (minus strand: 
SEP ID NP: 431 

Please replace the paragraph beginning on page 14, line 22, with the following text: 
Fig. 5 schematically shows the step (e) of the method of the present invention and a 
step of inserting an amplification product obtained in the step (e) into a cloning vector for 
sequencing. The small DNA fragment (listed 5' to 3') is GATCNNNNNNNNNNNAAACG 
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rplus strand: SEP ID NO: 44^> and GATCCOTTTNNNNNNNNNNN fminus strand: SEP ID 
NP: 45V 

At page 38 (Abstract), after the last line, beginning on a new page, please insert the 
attached substitute Sequence Listing. 
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Method for analyzing expression frequencies of 

genes 



Technical Field 

The present invention relates to a method 
for analyzing expression frequencies of genes. 
More precisely, the present invention relates to 
a method that enables analysis of types and 
amounts of mRNA expressed from all of genes 
coding for proteins in a cell even with a small 
amount of biosample in order to elucidate dynamic 
change of gene expression. 



Background Art 

The total number of genes on the genome 
coding for proteins is expected to be about 
100,000 for human. As for yeast, of which total 
genomic structure have already been elucidated, 
the number of genes coding for proteins are 
estimated to be about 5000 . 

In recent years, public gene databanks have 
been established mainly in Europe, the United 
States and Japan. An enormous amount of gene 
information has been registered at such databanks 
from all over the world, and further information 
is newly coming together into the databanks every 
day. The human genome project is currently being 
pursued in a worldwide scale aiming at 
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elucidating the total genes of human genome by 
the year of 2005, and the gene information 
obtained in that project is also being registered 
at the databanks. By inquiring to such databanks 
about a certain gene sequence, one can know if 
any gene having the same sequence as, or 
analogous sequence to the gene sequence has 
already been registered or not and, if registered, 
information concerning the sequence including 
designation and function of the gene, related 
references and so forth. Such a search is called 
homology search. There are several kinds of 
software for performing homology search- However, 
when a large number of samples must be searched 
for, BLAST is usually used, of which searching 
time is short. 

Usually, all of genes contained in a cell 
are not necessarily transcribed into mRNA so as 
to produce proteins from mRNA, and it is 
estimated for human that about 15000 genes are 
expressed in one cell. Thus, in a cell, many 
kinds of genomic genes are expressed, and a 
corresponding number of types of messenger RNA 
(henceforth referred to as "mRNA") are produced. 
However, types and amounts of expressed genes 
(also referred to as "genetic expression 
frequency information" hereinafter) may vary 
depending on the types and conditions of cells . 
For example, when a blood stem cell 



O ^ r?! & O 2? S - O S : P 6 O J t 



3 



differentiates into a lymphocyte precursor cell, 
pre-B cell, B cell, and then activated B cell, 
each cell shows entirely different gene 
expression, although there are also genes that 
are commonly expressed in them. 

Measurement of such genetic expression 
frequency information as described above is 
called genetic expression profile analysis. 
Substances responsible for cellular life 



important to analyze types and amounts of 
proteins translated from mRNAs as the genetic 
expression analysis. However, it is technically 
difficult under a current situation to obtain 
profiles for the total proteins. On the other 
hand, measurement of total types of mRNA has 
already become possible. 

The method reported for the first time as 
the genetic expression profile analysis method is 
the Body Map method (Gene, 174, 151-158 (1996)). 
The outline of the Body Map method is as follows. 
A poly(T) sequence on a vector is annealed to a 
poly(A) tail at the 3' end of each mRNA, and a 
cDNA is synthesized by using the vector poly(T) 
sequence as a primer. Further, the cDNA is 
digested with a restriction enzyme Mbol * Since 
one of MJbol site exists in every 300 base pairs 
of cDNA in average, the cDNA on the vector is 
digested into a length of 300 base pairs in 
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average. At this time, a cDNA fragment nearest 
the poly(A) tail remains ligated to the vector. 
The vector having this cDNA fragment is each 
cyclized and introduced into Escherichia coli to 
prepare a cDNA library. About 1000 clones are 
arbitrarily selected from the library, and the 
nucleotide sequence for 300 base pairs in average 
of each clone is determined- The clones were 
divided into groups of clones having the same 
sequence, and type and occurring frequency of 
each sequence are calculated to obtain genetic 
expression profile. Homology search (BLAST 
search) is performed in a databank for each cDNA 
sequence, and clones containing genes having the 
same sequence as known genes are given the names 
of the genes. When the sequence is not registered 
at the databank, it is considered that no gene 
corresponding to the sequence exists. 

In order to perform homology search by the 
BLAST search, information for at least 11 base 
pairs is required. The types of sequences 
consisting of 10 nucleotides are about 1,000,000, 
and this number is far beyond the number of gene 
types of which existence is expected in human, 
i.e., 100,000. That is, if there is information 
for 11 base pairs, a gene having the sequence can 
be identified and thus the genetic expression 
profile analysis is possible. Therefore, if, 
aiming at increasing the efficiency of the 
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genetic expression profile analysis by Body Map 
which requires much sequencing, cDNA fragments of 
about 300 base pairs used in Body Map are further 
made into short fragments of 11 base pairs or 
more (called "tag"), many of these fragments are 
ligated and inserted into a vector to prepare a 
library of ligated tags, about 1000 clones are 
arbitrarily selected as in Body Map, and DNA 
sequences of the ligated tags are determined, it 
is expected that more genetic expression 
information can be obtained with the same labor 
compared with Body Map. Each tag represents a 
gene sequence, and occurring frequency of the tag 
indicates expression frequency of the gene. Since 
the length of DNA sequence that can be determined 
by once of sequencing is usually about 600 base 
pairs, DNA sequences of about 50 tags at most can 
be determined by once of sequencing. That is, it 
becomes possible to perform the genetic 
expression profile analysis with efficiency about 
50 times higher than that of the Body Map method. 

As a method for genetic expression profile 
analysis based on the aforementioned concept, 
there is the method of serial analysis of gene 
expression (SAGE, U.S. Patent Nos . 5,695,937 and 
5,866,330, European Patent Publication No. 
0761822 A). In SAGE, cDNA is produced by using a 
poly(T) of which 3' end is bonded with biotin as 
a primer, the cDNA is digested with a restriction 



enzyme such as Mbol (called an "anchoring 
enzyme") as in Body Map, cDNA fragments 
containing the 3 ' end to which biotin is bonded 
are adsorbed on avidin beads, the beads are 
divided into two of portions, and two kinds of 
linkers (A or B) are each ligated to the cDNA 
fragments (about 13 bp) adsorbed on either of the 
two portions of the beads . Each linker contains a 
site for a Class II restriction enzyme such as 
BsmFI (called a "tagging enzyme"). Each cDNA 
fragment is excised from the beads with the 
tagging enzyme, the excised end is blunt-ended, 
and the tags ligated to the linker A and the 
linker B are connected. The product of the 
connection is called a "ditag". The ditag is 
amplified by PCR using primers that recognize the 
linker A and the linker B. A large number of 
amplified ditags are ligated, inserted into a 
vector, and sequenced. About 50 tag sequences can 
be obtained by once of sequencing. By calculation 
based on this tag sequence information, genetic 
expression frequencies are provided. 

Further, as other methods for analyzing 
expression frequencies of genes, there are the 
gene chip method and the gene microarray method. 
In both of the methods, there are used gene 
fragments adhered in array to a suitable plate 
(usually slide glass) at an extremely high 
density (about 10 fragments /mm 2 or more). The 
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gene fragments on this chip are hybridized with 
fluorescence-labeled mRNAs to determine types and 
amounts of mRNAs - 

As described above, several methods have 
been developed for analyzing expression 
frequencies of genes, and fair results have been 
obtained. Currently, the SAGE method is the most 
effective means for measuring expression 
frequencies of the total genes of all eukaryotic 
organisms. However, when this method was actually 
practiced, it encountered many problems, and the 
SAGE method could not be reproduced in most of 
research facilities. That is, the techniques 
required for the SAGE method are complicated and 
they can be performed only by specially trained 
persons. Further, about 1 ^g of mRNA is required 
for the measurement, and therefore it is 
substantially impossible to perform the 
measurement with a sample that can be obtained in 
a small amount, for example, a clinical biopsy 
material, or to measure difference of genetic 
expression in micro tissue portions. Furthermore, 
the method theoretically causes considerable 
measurement errors . 

In the SAGE method, it is extremely 
important to accurately determine a sequence of 
the tag. This is because the tag is short (about 
13 bp) and therefore, if even only one of 
nucleotides is erroneously determined, it may be 
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determined to be a different sequence even though 
it is the same sequence, or a different sequence 
may be determined to be the same sequence. 
However, such an erroneous determination is 
likely to occur in the SAGE method. This is 
because, in the SAGE method, two tags are 
connected to form a ditag and the border between 
the tags becomes indefinite. The tag is a short 
gene fragment excised by a restriction enzyme 
such as BsmFI and Fokl . However, cleavage sites 
of these enzymes are not always stable, and the 
lengths of the excised tags are diversified. Thus, 
if the tags are connected to form a ditag in the 
state that tags of different lengths are 
intermingled, it becomes indefinite from which 
tag nucleotides at the ligation site of the tags 
have been derived. As a result, it becomes 
impossible to obtain a correct sequence of the 
tag. Thus, the SAGE method suffers from a 
theoretically inevitable drawback. Furthermore, 
the SAGE method involves an operation of 
collecting DNA using avidin and biotin beads. 
However, it is in fact extremely difficult to 
collect DNA by using avidin and biotin beads 
without inviting contamination, and it is 
extremely difficult to obtain correct data by 
operating according to the protocol as it is. 
Moreover, the SAGE method requires a large amount 
of mRNA in order to obtain data. Therefore, in a 
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case of a sample of which amount is limited, for 
example, clinical samples, a sufficient amount of 
mRNA cannot be obtained and thus the SAGE method 
cannot be performed. 

Furthermore, in the gene chip method and the 
gene microarray method, the measurement can be 
performed only for a gene of which structure is 
known, unlike the Body Map method or the SAGE 
method. Therefore, under the current situation, 
expression frequencies of total genes of all 
organisms cannot be measured. 

Currently, the SAGE method is the most 
effective means for measuring expression 
frequencies of the total genes of all eukaryotic 
organisms. However, when this method was actually 
practiced, it encountered many problems, and the 
SAGE method could not be reproduced in most of 
research facilities- The drawbacks of the SAGE 
method are as follows: (1) the techniques 
required for the method are complicated and they 
can be performed only by specially trained 
persons; (2) about 1 jjg of mRNA is required for 
the measurement, and therefore it is 
substantially impossible to perform the 
measurement with a sample that can be obtained in 
a small amount, for example, a clinical biopsy 
material, and it is similarly impossible to 
measure difference of genetic expression in micro 
tissue portions; and (3) the method theoretically 
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causes considerable measurement errors because a 
ditag is measured. 

Disclosure of the Invention 

The present invention was accomplished in 
view of the aforementioned current situation, and 
an object thereof is to provide a method that can 
be easily performed by ordinary researchers and 
enables accurate genetic expression frequency 
analysis with a micro amount of a specimen* 

The inventors of the present invention 
assiduously studied in order to achieve the 
aforementioned object. As a result, they found 
that expression frequencies of genes could be 
efficiently analyzed with high precision by 
synthesizing each cDNA from each mRNA using a 
vector primer having a poly(T) sequence, 
converting each cDNA sequence to a tag on the 
vector, forming a concatemer by ligating the 
obtained tags via a sequence that enables 
recognition of ends of the tags, and analyzing a 
nucleotide sequence of the concatemer, and thus 
accomplished the method of the present invention 
designated as "MAGE (Micro-analysis of Gene 
Express ion ) " . 

That is, the present invention provides the 
f ollowings . 

(1) A method for analyzing expression 
frequencies of genes, which comprises the 
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following steps: 

(a) a step of forming a vector primer to which 
each cDNA is ligated, by annealing the vector 
primer with each mRNA derived from a cell of 
which expression frequencies of genes is to be 
analyzed, and synthesizing the cDNA , said vector 
primer comprising a linear plasmid vector having 
a single-stranded poly(T) sequence at one 3' end, 
a recognition sequence for a first restriction 
enzyme in an inner position from the poly(T) 
sequence, a recognition sequence for a second 
restriction enzyme near the other end, and a 
recognition sequence for a type IIS restriction 
enzyme in an inner position from the recognition 
sequence for the second restriction enzyme, 

(b) a step of digesting the vector primer to 
which the cDNA is ligated, with the second 
restriction enzyme and a third restriction enzyme 
that does not digest the vector primer and forms 
a digested end of the same shape as a digested 
end obtained with the second restriction enzyme, 
to excise an upstream region of the cDNA, and 
cyclizing the vector primer, 

(c) a step of digesting the cyclized vector 
primer with the first restriction enzyme and the 
type IIS restriction enzyme to excise a 
downstream region of the cDNA so that a tag 
consisting of a part of the cDNA is left, and 
cyclizing the vector primer again, 
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(d) a step of performing polymerase chain 
reaction (PGR) by using the vector primer as a 
template and oligonucleotides having nucleotide 
sequences corresponding to respective flanking 
regions of the both sides of the tag contained in 
the vector primer as primers to amplify the tag, 

(e) a step of ligating the amplification products 
to form a concatemer of the tags, and 

(f) a step of determining the nucleotide sequence 
of the concatemer and investigating types and 
frequencies of tags occurring in the nucleotide 
sequence . 

(2) The method according to (1)/ wherein the 
ligation reaction in the step (e) is performed in 
the presence of an adaptor having one end of the 
same shape as an end of the tag to arrange the 
adaptor at each end of the concatemer, and the 
concatemer is amplified by performing PCR using 
an oligonucleotide having a sequence 

corresponding to the sequence of the adaptor as a 
primer • 

(3) The method according to (1) or (2), 
wherein, after the step (e), the concatemer is 
cloned in a cloning vector for nucleotide 
sequencing, and then the nucleotide sequence of 
the concatemer is determined. 

(4) The method according to any one of (1) 
to (3), wherein the recognition sequence for the 
third restriction enzyme consists of four 
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nucleotides . 

(5) The method according to any one of (1) 
to (4), wherein the vector primer has a 
recognition sequence for a fourth restriction 
enzyme of which digestion point is in the same 
position as or an inner position from the 
digestion point of the recognition sequence for 
the second restriction enzyme, which is not 
excised from the vector primer by the digestion 
with the type IIS restriction enzyme ; 

the primer for the downstream side of the 
tag among the primers used in the step (d) has a 
recognition sequence for a fifth restriction 
enzyme that forms an end of the same shape as the 
end digested with the fourth restriction enzyme; 
and 

the concatemer is formed after the amplified 
primers are digested with the fourth restriction 
enzyme and the fifth restriction enzyme. 

(6) The method according to (5), wherein the 
vector primer has a nucleotide sequence different 
from the recognition sequence for the fifth 
restriction enzyme by one nucleotide in an inner 
position from the recognition sequence for the 
first restriction enzyme, and the nucleotide 
sequence different by one nucleotide is converted 
to the recognition sequence for the fifth 
restriction enzyme by PCR using the primer for 
the downstream side of the tag. 
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(7) The method according to (6), wherein the 
third, fourth and fifth restriction enzymes are 
identical to one another. 

(8) The method according to any one of (1) 
to (7), wherein the vector primer is formed by 
ligating a linear plasmid obtained by digesting a 
plasmid having a multicloning site at two sites 
in the multicloning site, and a partially double- 
stranded DNA having an end of the same shape as 
one end of the linear plasmid and a single- 
stranded poly(T) sequence. 

Brief Explanation of the Drawings 

Fig. 1 shows an exemplary structure of 
plasmid DNA for producing a vector primer 
(MAGE/pUC19 ) . 

Fig. 2 shows an exemplary structure of a 
vector primer and a construction process therefor 

Fig. 3 schematically shows the steps (a) and 
(b) of the method of the present invention. 

Fig. 4 schematically shows the steps (c) and 
(d) of the method of the present invention. 

Fig. 5 schematically shows the step (e) of 
the method of the present invention and a step of 
inserting an amplification product obtained in 
the step (e) into a cloning vector for sequencing 

Best Mode for Carrying out the Invention 
The method of the pres ent invent ion 
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comprises the aforementioned steps (a) to (f). 
Hereafter, each step will be explained. 
<1> Step (a) 

In the first step, a vector primer 
comprising a linear plasmid vector having a 
single-stranded poly(T) sequence at one 3' end, a 
recognition sequence for a first restriction 
enzyme in an inner position from the poly (T) 
sequence, a recognition sequence for a second 
restriction enzyme near the other end, and a 
recognition sequence for a type IIS restriction 
enzyme in an inner position from the recognition 
sequence for the second restriction enzyme is 
used (also referred to as a "the vector primer 
for reverse transcription" hereinafter). The type 
IIS restriction enzyme means a restriction enzyme 
that cleaves a specific position remote from a 
sequence recognized by the restriction enzyme. 

An exemplary structure of the vector primer 
is shown in Fig. 2, D. This vector primer can be 
prepared by, for example, ligating a linear 
plasmid obtained by digesting a plasmid having a 
multicloning site at two sites in the 
multicloning site, and a partially double- 
stranded DNA having an end of the same shape as 
one end of the linear plasmid and a single- 
stranded poly(T) sequence. The structure of 
MAGE/pUC19 is shown in Fig. 1 as an example of 
the plasmid having a multicloning site. This 



ini,ra a""ii ™u o p.?; o» tn\ o ii sr 



16 

primer comprises pUC 19, which is a known cloning 
vector, as a basic structure, and a sequence 
containing various restriction enzyme recognition 
sites (referred to as "Z fragment", sequence is 
shown in SEQ ID NO: 1) is inserted between the 
EcoRI and Hindlll restriction enzyme sites of the 
multicloning site thereof. 

In the following specific example, 
explanation will be made for an example that 
utilizes MAGE/pUC19 prepared by using an 
Escherichia coli host having Dam methylation 
system, for example, JM109 strain etc. However, 
it is not essential for the present invention. 
Further, the vector used as the basic structure 
is not limited to pUC19, and various other 
vectors such as pUC18, pBR322, pHSG299, pHSG298, 
pHSG399, pHSG398, RSF1010, pMWll9, pMWll8, pMW2 1 9 
and pMW218 can be used. In addition, the host of 
the vector is not particularly limited, so long 
as it is one for which ordinary genetic 
recombination techniques such as transformation 
and recovery of vectors from the host can be used 
However, Escherichia coli is usually used. 

An example utilizing MAGE/pUC19 as the 
plasmid having a multicloning site will be 
explained hereafter. However, the present 
invention is not limited to this example. 

First, MAGE/pUC19 vector (Fig. 2, A) is 
digested with restriction enzymes BstXI and PstI, 
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and the excised small fragment is removed (Fig. 2, 
B). Then, a partially double- stranded DNA having 
a single-stranded poly(T) sequence (poly(T) 
adaptor shown in Fig. 2, C) is ligated to the 
BstXI -digested end. This provides a vector primer 
(or a vector primer for reverse transcription) in 
which the single-stranded poly(T) protruding from 
one end of the MAGE/pUC19 vector (Fig. 2, D). In 
this vector primer, the first restriction enzyme 
is Pmel , the second restriction enzyme is Bglll , 
and the type IIS restriction enzyme is Bsgl . 

The recognition sequence for the second 
restriction enzyme Bglll contains an overlapping 
Mbol recognition sequence, and therefore if the 
Bglll end is ligated to the Mbol end, it can be 
digested with Mbol . Further, in this example, the 
vector primer is designed to contain a nucleotide 
sequence {AMbol of Fig. 3, E) that is different 
from the fourth restriction enzyme (Mjbol) 
recognition sequence by one nucleotide in a 
further inner position from the first restriction 
enzyme (Pmel) recognition sequence. This AMbol 
and the first restriction enzyme recognition 
sequence of the vector primer are derived from 
the poly(T) adaptor. 

The first restriction enzyme and the second 
restriction enzyme are not particularly limited, 
so long as they digest the vector primer at one 
position. Further, the type IIS restriction 
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enzyme and its recognition sequence-existing 
position are not particularly restricted, so long 
as it does not excise the fourth restriction 
enzyme site from the vector primer, and it 
digests the upstream region of the cDNA so that a 
part of the cDNA is left on the vector primer. 
Specifically, it may be, for example, BsmFI etc., 
in addition to the aforementioned Bsgl. 

Further, the length of the poly(T) sequence 
may be a length that enables annealing with the 
poly(A) sequence of mRNA , and it is usually about 
10-50 nucleotides . 

Such a vector primer as mentioned above and 
mRNA derived from a cell for which expression 
frequencies of genes are to be analyzed are 
annealed. When a reverse transcription reaction 
is performed in that state, the poly(T) serves as 
a primer and cDNA synthesis is initiated (Fig. 3, 
E). Then, the second strand can be synthesized by 
using the synthesized first strand of cDNA as a 
template to synthesize a double-stranded cDNA 
(Fig. 3, F). While a large number of vector cDNA- 
ligated primer molecules ( cDNA-MAGE / pUC 1 9 ) can be 
produced from a large number of mRNA molecules, 
Fig. 3, F represents one of typical example 
thereof . 

The mRNA is extracted from a cell for which 
expression frequencies of genes are to be 
analyzed. Any cell can be used as the cell for 
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which expression frequencies are to be analyzed 
without any particular limitation, so long as it 
is a cell in which 3 ' end of mRNA has the poly (A) 
structure, and it may be a tissue cell of an 
animal or a plant, a cell of a microorganism such 
as yeast- Further, mRNA of a prokaryote does not 
have the poly (A) structure at the 3' end and 
therefore it cannot be annealed as it is to the 
poly(T) of the vector primer. However, by 
enzymat ically adding a poly(A) structure to the 
mRNA, the method of the present invention can be 
performed in a manner similar to that for mRNA of 
an eukaryotic organism- 
Operations such as preparation of mRNA, cDNA 
synthesis, synthesis of oligonucleotide, reaction 
with a restriction enzyme, ligation reaction, 
polymerase chain reaction (PCR), and 
transformation can be performed in the same 
manner as methods for preparing mRNA used for 
usual cDNA cloning (see, for example, Sambrook J-, 
Fritsch, E.F., and Maniatis, T . , "Molecular 
Cloning A Laboratory Manual, Second Edition", 
Cold Spring Harbor Laboratory Press (1989) etc.)* 

<2> Step (b) 

Then, the vector primer to which cDNA is 
ligated, obtained as described above, is digested 
with the second restriction enzyme and a third 
restriction enzyme that does not digest the 
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vector primer and forms a digested end of the 
same shape as a digested end obtained with the 
second restriction enzyme, to excise an upstream 
region of the cDNA, and the vector primer is 
cyclized. The cyclized vector primer can be 
amplified by introducing it into a suitable host, 
culturing the obtained t rans f ormant , and 
collecting a plasmid. As will be similarly 
applied to cyclized vector primers used in the 
subsequent steps, when the restriction enzyme 
site for cleaving the obtained vector primer 
cannot be digested due to Dam methylation, a host 
that does not have the modification system is 
used . 

As the third restriction enzyme, an enzyme 
recognizing four nucleotides is preferred. If an 
enzyme recognizing six nucleotides is used, such 
a restriction enzyme site may not exist in the 
cDNA sequence. Further, if the sequence of cDNA 
left on the vector primer is long, the tag 
obtained by the subsequent procedures will become 
far from the poly(A) sequence of mRNA ♦ In such a 
case, since the expression information of genes 
in a database (EST: expressed sequence tag) is 
usually for parts of the 3' end side of mRNA, the 
target sequence may not be retrieved even if tag 
sequences are searched. Examples of restriction 
enzyme suitable for the present invention, which 
recognize four nucleotides, include Mbol , Tail 
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and so forth. 

Hereafter, an example utilizing MJbol as the 
third restriction enzyme will be explained. First, 
the cDNA-ligated vector primer ( c DNA-MAGE / pUC 1 9 ) 
is digested with restriction enzymes Sglll and 
Mbol (Fig. 3, G). Since MAGE/pUC19 have been Dam- 
methylated at this time, it is not digested with 
Mbol, and only the MJbol site newly synthesized in 
the cDNA by the reverse transcription is digested. 
Fig. 3, G shows a temporary example where the 
cDNA contains three of Mjbol sites. As for the 
cDNA, a downstream region from the poly(A) tail 
to the first Mbol site toward the upstream region 
remains ligated to the MAGE/pUC19 vector primer, 
and the other portion, i.e., the upstream region 
of the cDNA is excised and removed by the Mbol 
digestion. As for MAGE/pUC19, it is digested with 
Bglll at only one position. The end portions 
digested with Mbol and Bglll are in the same 
shape, and they can be ligated by a ligation 
reaction. Therefore, the cDNA-MAGE / pUC 1 9 is 
cyclized by a self -ligation reaction via these 
ends ( Fig . 3 , H ) . 

<3> Step (c) 

The vector primer cyclized in the step (b) 
is digested with the first restriction enzyme and 
the type IIS restriction enzyme to excise the 
downstream region of the cDNA, so that a tag 
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consisting of a part of the cDNA is left, and the 
vector primer is cyclized again. 

Specifically, the cyclized cDNA-MAGE / pUC 1 9 
is digested with restriction enzymes Bsgl and 
Pmel (Fig. 4, I). After digestion with these 
restriction enzymes, only about 13 nucleotides 
farthest from the poly (A) tail of the 3' end 
portion of the cDNA remain on the vector primer. 
This sequence of cDNA consisting of about 13 
nucleotides is called tag (shown in Fig. 4, J as 
"tag" ) . 

Since the strand of 5' end protrudes by 2 bp 
at the end digested with Bsgl, it is blunt-ended 
by T4 DNA polymerase treatment, for example. 
Further, other end digested with Pmel is a blunt 
end. Therefore, these ends can be ligated (Fig. 4 
J). Thus, if the vector primer is cyclized by a 
self -ligation reaction, a vector primer having a 
structure comprising MAGE/pUC19 to which a short 
tag is ligated, in which the part of cDNA other 
than the tag is excised, is obtained (Fig. 4, J 
and K) . 

<4> Step (d) 

Then, PCR is performed by using the vector 
primer cyclized in the step (c) as a template, 
and oligonucleotides having nucleotide sequences 
corresponding to respective flanking regions of 
the both sides of the tag in the vector primer as 
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primers to amplify the tag- 
Specifically, PCR is performed by using the 
vector primer containing the tag as a template, 
and primers corresponding to the sequences of the 
vector segments of the both sides of the tag (for 
example, those of SEQ ID NOS : 2 and 3) (Fig. 4, 
K). As a result, a DNA fragment containing the 
tag at the center is amplified (in this case, 110 
bp including the sequences of primers and the 
tag ) ( Fig . 4 , L ) * 

<5> Step (e) 

The aforementioned PCR amplification 
products are ligated to form a concatemer of the 
tags. At this time, if a recognition sequence for 
a fifth restriction enzyme that produces an end 
of the same shape as the end obtained by 
digestion with the fourth restriction enzyme is 
introduced into the primer for the downstream 
region of the tag among the primers used in the 
step of (d), and the amplified primer is digested 
with the fourth restriction enzyme and the fifth 
restriction enzyme, the concatemer can be formed 
efficiently. The type of the fifth restriction 
enzyme is not particularly limited. However, if 
the same restriction enzyme as the fourth 
restriction enzyme is used, the procedure will 
become simple. Further, the same restriction 
enzyme as the third restriction enzyme may be 
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used as the fourth restriction enzyme. 
Furthermore, the third, fourth and fifth 
restriction enzymes may be the same enzyme. 
However, if the same enzyme is used as the fifth 
restriction enzyme and fourth restriction enzyme, 
the total sequence of the cDNA will be excised 
from the vector primer in the step (b). Therefore, 
in order to prevent this, it is necessary to 
introduce a nucleotide sequence different from 
the recognition sequence for the fifth 
restriction enzyme by one nucleotide at a 
position in a further inner position from the 
first restriction enzyme recognition sequence of 
the vector primer, and perform PGR by using a 
primer having a recognition sequence for the 
fifth restriction enzyme for the downstream side 
of the tag so that the nucleotide sequence 
different by one nucleotide is replaced with the 
recognition sequence for the fifth restriction 
enzyme. Further, when the same enzyme is used as 
the third restriction enzyme and fourth 
restriction enzyme, the recognition site of the 
vector primer for the fourth restriction enzyme 
must be made to be not digested by methylation. 
In this case, before digestion with the fourth 
restriction enzyme, the vector primer cyclized in 
the step (c) is introduced into the host not 
having the modification system for demodif icat ion . 
If the ligation reaction of the tags is 
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performed in the presence of an adaptor having 
one end of the same shape as an end of the tag to 
arrange the adaptor at each end of the concatemer, 
and PCR is performed by using an oligonucleotide 
having a sequence corresponding to the sequence 
of the adaptor as a primer, the concatemer can be 
amplified. If the adaptor is used in a smaller 
amount compared with the tag, or an adaptor 
having one end of a shape different from that of 
the tag, the adaptor can be arranged at each end 
of a concatemer consisting of a large number of 
tags. In this way, even a micro amount of 
concatemer can be cloned, and as a result, 
analysis becomes possible with a micro amount of 
sample mRNA . 

The molar ratio of the tags and the adaptor 
is usually tag:adaptor = 1:1 to 1:0.01, 
preferably 1:0.2 to 1:0.05. If the adaptor is 
used in this range, the concatemer composed of 
about 2 to 50 ligated tags may be obtained. 

Insertion of the obtained concatemer into a 
cloning vector for nucleotide sequencing makes 
the sequencing operation easy. 

When MAGE/pUCl9 is used, a small fragment 
containing a tag is obtained by digesting the 
amplification product of the step (d) with a 
restriction enzyme Mbol (Fig. 5, M). The tag 
fragment obtained here is present between known 
DNA sequences, i.e., GATC and AAACG, and 
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therefore it: can definitely recognize which 
portion is the tag. Since Mbol sites are exposed 
at the both ends of this tag sequence, the tags 
can ligated to each other by a ligation reaction 
to form a concatemer (Fig- 5, M). 

<6> Step ( f ) 

If the nucleotide sequence of the concatemer 
obtained as described above is determined, since 
tags derived from many cDNA molecules are 
contained in it, the expression frequencies of 
the genes from which the cDNAs originated can be 
analyzed by investigating types and frequencies 
of the tags occurring in the nucleotide sequence. 
Type of the tag can be determined by searching 
for a database containing information of partial 
sequences of known mRNAs (EST). 

When the concatemer of the tags is amplified, 
two or more fragments having the same sequence 
will be formed. However, if a sequence considered 
to be the same as one previously sequenced 
appears in the sequence analysis after the 
cloning, it can be excluded from the analysis to 
eliminate bad influence of PCR on the genetic 
expression frequency analysis. 

Further, when the same enzyme is used as the 
fourth and fifth restriction enzymes, since 
direction of the tags in the concatemer is not 
fixed, sequences of not only one strand but also 
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the reverse strand are taken into consideration 
in the sequence analysis. 

As explained above, in the method of the 
present invention, a ditag is not produced as in 
the SAGE method, but a tag can be prevented from 
directly connecting with other tags by putting 
each tag between known DNA fragments. As a result, 
the problem that the border between tags becomes 
indefinite is solved. Further, since a sample can 
be amplified by performing PCR several times, the 
analysis can be possible even with a micro amount 
of mRNA. Furthermore, the use of vector primer 
enables cDNA synthesis in the state that the cDNA 
is fused to the vector. Therefore, the analysis 
can be performed without using avidin and biotin 
beads . 

Example 

Thirty five micrograms (35 /jg) of mRNA was 
extracted from 1 g of C57BL/6 mouse livers by 
using Fast Track 2.0 kit produced by Invitrogen. 

After cDNA was synthesized through reverse 
transcription by using 0.97 /jg of the obtained 
mRNA and MAGE/pUC19, genetic expression frequency 
analysis was performed as follows. 

The MAGE/pUC19 vector (Fig. 2, A) was 
digested with restriction enzymes BstXI and PstI, 
and the excised small fragment was removed (Fig. 
2, B). Then, a partially double-stranded DNA 
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having a single-stranded poly(T) sequence 
(poly(T) adaptor shown in Fig. 2, C) was ligated 
to the BstXI-digested end (Fig. 2, D). In this 
vector primer, the first restriction enzyme was 
Pmel , the second restriction enzyme was Bglll , 
and the type IIS restriction enzyme was JBsgl (Fig. 
3, E) . 

Zero point two microgram (0.2 jjq) of the 
vector primer and 0.97 jjg of the mRNA derived 
from mouse liver were annealed, and cDNA 
synthesis was performed. Then a second strand was 
synthesized by using the synthesized first cDNA 
as a template (Fig. 3, F). Genetic expression 
frequency analysis was performed by using the 
obtained cDNA in an amount of 1/40 of the 
obtained amount of cDNA (corresponding to 0.025 
jjg of mRNA) as a material. 

In another experiment, genetic expression 
frequency analysis could be successfully 
performed by using 0.05 jug of micro amount mRNA 
extracted from 4.5 mg of micro amount clinical 
sample as an experimental material to synthesize 
cDNA through reverse transcription and following 
the same procedures as described below. 

Then, the vector primer to which the cDNA 
was ligated obtained as described above was 
digested with Bgrlll and Mbol , which were the 
second and third restriction enzymes, 
respectively (Fig. 3, G). Subsequently, the Mbol 
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end and Bglll end of cDNA-MAGE /pUC 1 9 were ligated 
by a self-ligation reaction to cyclize the vector 
primer (Fig- 3, H). 

The above cyclized vector primer was 
digested with Bsgl and Pmel , and the digested 
ends were blunt-ended by T4 DNA polymerase 
treatment. Then, the vector primer was cyclized 
again (Fig. 4, I to K). PCR was performed by 
using this cyclized vector primer as a template, 
oligonucleotides having nucleotide sequences 
corresponding to the respective flanking regions 
of the both sides of the tag in the vector primer 
(SEQ ID NOS : 2 and 3) as primers, and an enzyme 
AmpliTaq Gold (PE Biosystems) to amplify the tag 
(Fig. 4, L ) . The above PCR was performed by 
repeating a reaction cycle of denaturation (95°C, 
0.3 minute), annealing and an extension reaction 
with the polymerase (72°C, 1-5 minutes) for 65 
cycles. As a result, a DNA fragment of 110 bp 
containing the tag at the center was amplified. 

The above PCR amplification product was 
digested with Mbol to excise the tag, and a 
ligation reaction was performed with addition of 
an adaptor obtained by annealing oligonucleotides 
having the nucleotide sequences shown in SEQ ID 
NOS: 4 and 5 such that the ratio of tagtadaptor 
should be 8:1 to form a concatemer of the tags 
(Fig. 5, M) . This concatemer was amplified by PCR 
using a primer having the nucleotide sequence 
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shown in SEQ ID NO: 6. For PCR, a reaction cycle 
of denaturation (95°C, 0*3 minute), annealing 
(40°C / 3 minutes) and an extension reaction with 
polymerase (72°C, 1 minute) was repeated for 5 
cycles, and then a reaction cycle of denaturation 
(95°C, 0.3 minute), annealing and an extension 
reaction (72°C, 1 minute) was repeated for 60 
cycles. The amplification product was digested 
with a restriction enzyme JVotI, and inserted into 
the NotI site of the cloning vector pKF3 for 
sequencing, and the nucleotide sequence was 
determined . 

Based on the obtained sequence data (for 
11000 tags), the tags were divided into groups of 
tags having the same sequence, and the order of 
the groups according to the occurring frequency 
was determined. The tags ranked first to tenth 
for the frequency from the highest are shown in 
Table 1 . This result shows that the gene showing 
the highest expression frequency in the mouse 
liver tissue was the urinary protein I/II gene. 
Following that gene, the albumin gene, the 
urinary protein III gene and the 

argininosuccinate synthase gene are ranked in the 
order of the expression frequency from higher one 
Thus, even with a micro amount of mRNA sample, 
the method of the present invention shows high 
recovery efficiency of cDNA by the use of a 
vector primer, and enables genetic expression 
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frequency analysis with high precision since 
ambiguity of tag sequences can be eliminated. 



Table 1 



Tag (SEQ ID NO) 


Rank for 
frequency 


Number 
of tag 


Designation of gene 


TGCATTCCATC (7) 


1 


2 196 


Urinary protein I/II 


CCTGGTGGAAA ( 8 ) 


2 


126 1 


Albumin 


TGCTCTCCACC ( 9 ) 


3 


485 


Urinary protein III 


GGGAAGTACGC (10) 


4 


383 


Argininosuccinate 
synthase 


ACCTCGGATGA (11) 


5 


345 


Fibrinogen Aa 


TTCCAGGCCCG ( 12 ) 


6 


333 


Apolipoprotein E 


ACCAGTGTCGC (13) 


7 


310 


Mouse EST 


TGCATGCCCTG (14) 


8 


307 


Ferritin light chain 


CACTACAGCAC (15) 


9 


300 


Mouse EST 


CTGCCAAGTTC (16) 


10 


226 


Retinol-binding 
protein 



Industrial Appl icabil ity 

According to the present invention, 
expression frequencies of genes can be surely 
analyzed with good precision in a simple manner. 

The method for analyzing genetic expression 
of the present invention is useful for researches 
concerning life science, and it is particularly 
useful for development of methods for treatment 
or diagnosis of diseases through analysis of 
difference in genetic expression between specific 
organs or cells of healthy subjects and subjects 
with diseases. For example, by analyzing 
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difference in expression of genes between livers 
of healthy subject and subject with hepatitis by 
the method of the present invention, a gene of 
which expression is specifically increased or 
decreased in hepatitis can be discovered. By 
investigating the role in the liver of such a 
gene, drugs for inhibiting or accelerating the 
function of the gene can be developed for the 
purpose of treatment of hepatitis. Moreover, the 
gene itself, an antisense oligonucleotide 
designed based on the gene structure, a protein 
produced by expression of the gene and so forth 
can also be used for the treatment of hepatitis. 

Further, even for a disease of which 
mechanism for its onset is not elucidated yet, a 
method of treatment may be developed by using the 
method of the present invention. Furthermore, if 
a gene of which expression is changed in a 
disease-specific manner can be found by the 
method of the present invention, it becomes 
possible to develop not only a method of 
treatment, but also a method of diagnosis of a 
disease . 

Moreover, not only use in the medical field, 
the method can also be means for identifying 
useful genes from all of eukaryotic organisms. 
For example, after yeast strains suitable for the 
brewing of beer are bred by mutation, change of 
genetic expression between a parent strain and a 
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mutant strain can be analyzed by the method of 
the present invention, so that a gene of which 
expression is changed by the mutation can be 
identified. By comprehensively manipulating genes 
advantageous for the brewing of beer, which are 
obtained as described above, a better brewers' 
yeast strain can be created. 

Further, by analyzing amino acid-producing 
bacteria such as Escherichia coll and 
Corynebacterlum bacteria, for example, more 
excellent amino acid-producing bacteria can be 
created . 
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WHAT IS CLAIMED IS: 

1. A method for analyzing expression 
frequencies of genes, which comprises the 
following steps: 

(a) a step of forming a vector primer to which 
each cDNA is ligated, by annealing the vector 
primer with each mRNA derived from a cell of 
which expression frequencies of genes is to be 
analyzed, and synthesizing the cDNA, said vector 
primer comprising a linear plasmid vector having 
a single-stranded poiy(T) sequence at one 3' end, 
a recognition sequence for a first restriction 
enzyme in an inner position from the poly(T) 
sequence, a recognition sequence for a second 
restriction enzyme near the other end, and a 
recognition sequence for a type IIS restriction 
enzyme in an inner position from the recognition 
sequence for the second restriction enzyme, 

(b) a step of digesting the vector primer to 
which the cDNA is ligated, with the second 
restriction enzyme and a third restriction enzyme 
that does not digest the vector primer and forms 
a digested end of the same shape as a digested 
end obtained with the second restriction enzyme, 
to excise an upstream region of the cDNA, and 
cyclizing the vector primer, 

(c) a step of digesting the cyclized vector 
primer with the first restriction enzyme and the 
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type IIS restriction enzyme to excise a 
downstream region of the cDNA so that a tag 
consisting of a part of the cDNA is left, and 
cyclizing the vector primer again, 

(d) a step of performing polymerase chain 
reaction (PCR) by using the vector primer as a 
template and oligonucleotides having nucleotide 
sequences corresponding to respective flanking 
regions of the both sides of the tag contained in 
the vector primer as primers to amplify the tag, 

(e) a step of ligating the amplification products 
to form a concatemer of the tags, and 

(f) a step of determining the nucleotide sequence 
of the concatemer and investigating types and 
frequencies of tags occurring in the nucleotide 
sequence . 

2. The method according to Claim 1, wherein 
the ligation reaction in the step (e) is 
performed in the presence of an adaptor having 
one end of the same shape as an end of the tag to 
arrange the adaptor at each end of the concatemer, 
and the concatemer is amplified by performing PCR 
using an oligonucleotide having a sequence 
corresponding to the sequence of the adaptor as a 
primer . 

3. The method according to Claim 1 or 2, 
wherein, after the step (e), the concatemer is 
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cloned in a cloning vector for nucleotide 
sequencing, and then the nucleotide sequence of 
the concatemer is determined, 

4. The method according to any one of Claims 
1-3, wherein the recognition sequence for the 
third restriction enzyme consists of four 
nucleotides . 

5. The method according to any one of Claims 
1-4, wherein the vector primer has a recognition 
sequence for a fourth restriction enzyme of which 
digestion point is in the same position as or an 
inner position from the digestion point of the 
recognition sequence for the second restriction 
enzyme, which is not excised from the vector 
primer by the digestion with the type IIS 
restriction enzyme ; 

the primer for the downstream side of the 
tag among the primers used in the step (d) has a 
recognition sequence for a fifth restriction 
enzyme that forms an end of the same shape as the 
end digested with the fourth restriction enzyme; 
and 

the concatemer is formed after the amplified 
primers are digested with the fourth restriction 
enzyme and the fifth restriction enzyme. 

6. The method according to Claim 5, wherein 
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the vector primer has a nucleotide sequence 
different from the recognition sequence for the 
fifth restriction enzyme by one nucleotide in an 
inner position from the recognition sequence for 
the first restriction enzyme, and the nucleotide 
sequence different by one nucleotide is converted 
to the recognition sequence for the fifth 
restriction enzyme by PCR using the primer for 
the downstream side of the tag. 

7. The method according to Claim 6, wherein 
the third, fourth and fifth restriction enzymes 
are identical to one another. 

8. The method according to any one of Claims 
1-7, wherein the vector primer is formed by 
ligating a linear plasmid obtained by digesting a 
plasmid having a multicloning site at two sites 
in the multicloning site, and a partially double- 
stranded DNA having an end of the same shape as 
one end of the linear plasmid and a single- 
stranded poly(T) sequence. 
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ABSTRACT OF THE DISCLOSURE 

In gene expression frequency analysis, a 
vector primer to which each cDNA is ligated is 
formed by synthesizing the cDNA from each mRNA 
using by a vector primer having a poly(T) 
sequence; each cDNA sequence is converted to a 
tag on the vector primer; a concatemer is formed 
by ligating the obtained tags via a sequence that 
enables recognition of ends of the tags; and the 
nucleotide sequence of the concatemer is analyzed- 
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09/926 02 8 



Z fragment (SEQ ID NO: 1) 




Fig. 1 
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; ; Z-fro^nent 

! • 

EcoR j BsgJlMbo I; Pst I BstX I 

Cgmttcgtgc^tctcaca'c^ 
CTTAAGCACCTCTAGAGTGTGACGTCT^ 
MAGE/pUC19 J 

)Pst I 
BstX I 

^^Removal 

GAATTCGTGCAGATCTCACACTGCA GAGA^CCAACAGC ATGGAAGCTT 

Q /^"^ CTTAAGCACGTCTAGAGTGTG ACGTCTCTAGGTT GTCCTTACCTTCGAA^v 

c ) 

t 

CGGAGTTTAAACGGATTGGAGCCAGC 

I 1 1 I I 1 1 I I 1 1 I I I I 1 1 I I I 1 I I 1 1 1 1 I I 1 I 1 I I I 1 1 I f I GCCTCAAATTTGCCTAACCTCG i 

polyCO adaptor T Ligation 

Y AT 



ATGGAAGCTT 
GTCGTACCTTCGAA 



C 



GAATTCGTGCAGATCTCACACTGCA 
CTTAAGCACGTCTAGAGTGTG 




I 



First 

/ 

Pme I AMbo I 



^^^^ C^AGTTTAMCGGATTGGAGCCAGCATGGAAGCTT 

I II 1 1 I I 1 1 I I I I I I I I I N I I I I I I I I I ! I 1 1 I I I I I! I GCCTCAAATTTGCCTAACCTCGGTCGTACCTTCGAA* 



GAATTCGTGCAGATCTCACACTGCA 
CTTAAGCACGTCTAGAGTGTG 



Vector primer for reverse transcription 
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First 



mRNA 



-AAAAAAAAAAAAAAAAAAAAAAAA 



Second strand synthesis 



/ 



G 

Type IIS restriction enzyme 




Fifth 



-AAAAAAAAAAAAAA^ CGGAGTTT AAACGGATTG6AGCCAGCATGGAAGCTT, 

I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 GCCTCAAATTTGCCTAACCTCGGTCGTACCTTCGAA 

CD First strand synthesis BsgJLja* 11 

;^gaattc6to^^ Second 

*CTTAAGCACGT^TA^AGTGTG 
Mbo r 



-Synthesized cDNA- 



Vector primer for 
reverse transcription 



Fourth 



Pme I AMx) I 



NNNWNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNN^r^NNNNfWNNNNNNHNNNNNNt I I 1 II H I I 1 I 1 I I 1 I 1 1 I GCCTCAAATTTGCCTAACCTCGGTCGTACCTTCGAA ' 

Bql II 



JIAA 

i c: 



CGTGCAGATCTLAC 



Bsg 

^TTCGTGC^CUTCTCACACTGCA 
CTTAAGCACGT^TAG/VGTGTG 



Mbo I 



Synthesized cDNA 




Removal 



7 



I 




Third 
3' terminal portion 



Bgt II 




Removal 



H 



Ligation, cyclization 



3' terminal portion of 
synthesized cDNA 




x 



Mbol-digested end 
Bglll-digested end 



Fig. 3 



OBLON ET AL (703) 413-3000 




DOCKET # SHEET_^OFaS1. 



4/5 



c 



3* terminal portion of synthesized cDNA 
Bsq I Pme I 



GAATT 

'CTTAAGCAC GTCTAG ^M^tJ^M^MNwJiNN^tti 

Digestion point of Bsgl 



ggaGtttaaacCgattggagccagcatggaagctt 

bHW I II U I 1 1 H 1 1 I GCCTCAAATTTGCCTAACCTCGGTCGTACCTTCGAA 



K 



Bsg I 

Pme I treatment 



j^- Removal 



Bsg I 

GAATTc6tGCAgWwW4NNNNWNNN WWN^f#^h#^NAAjAAAAAAAAAAAACGGAGTTT AMCGGATTGGAGCCAGCATGGAAGCTT 
'CrrAAGCACGTCTAGjWWNNNNNWI. KNNTWNNNNNNWhMNNN I 1 I II 1 1 1 1 1 1 1 1 1 IGCCTCAAA TTTGCCTAACCTCGGTCGTACCTTCGAA 



( 



PCR primer 

UP(-48)26mer 



:ag 



(D Blunting of ends 
© Ligation, cyclization 



GMTCGTGCAUTCNNNNNW«NNNAAACGGATTG(^CCW 
J CTTAAGCAC GTCTAG fWNNNNNNhWNTTTGCCTAACCTC GGTCGTACCTTC GAA 



NNNNNWJNhWNTTTG 

tag " 



PCR primer \ 

ZZ-roakeft>o I \ 




Primer sequences 

UPC-48)26mer: ACGCCAGGGTTTTCCCAGTCACGACG 

ZZ-makeMbo I: AT6ATTACGCCAAGCTTCCATGCTGGCTCCGATCCGTTT 



PCR amplification product 

Mbpi tag 



Fifth 



UPC-^S)26mer primer 



GATCNNNNNNNNNNNAAACGGATC 
CTAGNNNNNNNNNNNTTTGCCTAG 



ZZ-make^t>o I primer 



Fig. 4 
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tag 



GATCNNNNNNNNNNNAAACG 

NNNNNNNNNNNTTTGCCTAG 



primer 



/ 



x 



31 IE 



tag tag tag tag tag tag tag tag tag 

PGR primer 




Cloning vector for sequencing 



Fig. 5 
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My residence, post office address and citizenship are as stated below next to my name, 

We (I) believe that we are (I am) die original, first, and joint (sole) Invet ror(s) of the subject matter which is 
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Method for analyzing expression frequencies of genes 



the specification of which 



is attached hereto. 
□ was filed on 
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SEQUENCE LISTING 
<110> Ajimonoto Co., Inc. 

<120> Method for analyzing expression frequencies of genes 

<130> B-583AY0P962 

<150> JP 11-38538 
<151> 1999-02-17 

<160> 16 

<210> 1 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: multi-cloning site 
<400> 1 

gattcgtgca gatctcacac tgcagagatc caacagcatg gaagctt 47 

<210> 2 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

acgccagggt tttcccagtc acgacg 26 

<210> 3 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 3 

atgattacgc caagcttcca tgctggctcc gatccgttt 

<210> 4 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: adaptor 
<400> 4 

gatcgagatc tgcaaccaga gtcg 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: adaptor 
<400> 5 

cgactctggt tgcagatctc 

<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

aacttcgact gcggccgcag atctcgatc 

<210> 7 
<211> 39 
<212> DNA 

<213> Mus musculus 

<400>7 
tgcattccat c 
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<210> 8 
<211> 11 
<212> DNA 

<213> Mus musculus 
<400> 8 

cctggtggaa a 11 

<210> 9 
<211> 11 
<212> DNA 

<213> Mus musculus 
<400> 9 

tgctctccac c 11 

<210> 10 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 10 

gggaagtacg c 11 

<210> 11 
<211> 11 
<212> DNA 

<213> Mus musculus 
<400> 11 

acctcggatg a 11 

<210> 12 
<211> 11 
<212> DNA 

<213> Mus musculus 
<400> 12 

ttccaggccc g 11 



<210> 13 
<211> 11 
<212> DNA 



<213> Mus musculus 

<400> 13 
accagtgtcg c 

<210> 14 
<211> 11 
<212> DNA 

<213> Mus musculus 

<400> 14 
tgcatgccct g 

<210> 15 
<211> 11 
<212> DNA 

<213> Mus musculus 

<400> 15 
cactacagca c 

<210> 16 
<211> 11 
<212> DNA 

<213> Mus musculus 



<400> 16 
ctgccaagtt c 
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SEQUENCE LISTING 



<110> MAEKAWA, TAKAMI 
MITSUI, AKIRA 
DATE, MASAYO 
FUKUDA, HI SAO 
T AK AH ARA , YO S H I YUK I 

<120> METHOD FOR ANALYZING EXPRESSION FREQUENCIES OF GENES 

<130> 212833US0PCT 

<140> 09/926, 028 
<141> 2001-08-16 

<150> PCT/JP00/00 902 
<151> 2000-02-17 

<150> JP 11-038538 
<151> 1999-02-17 

<160> 45 

<170> Patentln version 3.1 

<210> 1 
<211> 47 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 1 

gattcgtgca gatctcacac tgcagagatc caacagcatg gaagctt 

<210> 2 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 2 

acgccagggt tttcccagtc acgacg 

<210> 3 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 3 

atgattacgc caagcttcca tgctggctcc gatccgttt 



<210> 4 



<211> 
<212> 
<213> 



24 



DNA 



Artificial Sequence 



<220> 

<223> Synthetic DNA 

<400> 4 

gatcgagatc tgcaaccaga gtcg 

<210> 5 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 5 

cgactctggt tgcagatctc 

<210> 6 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 6 

aacttcgact gcggccgcag atctcgatc 



<210> 7 

<211> 11 

<212> DNA 

<213> Mus musculus 



<400> 7 

tgcattccat c 11 

<210> 8 

<211> 11 

<212> DNA 

<213> Mus musculus 



<400> 8 

cctggtggaa a 11 

<210> 9 

<211> 11 

<212> DNA 

<213> Mus musculus 



<400> 9 

tgctctccac c 11 

<210> 10 

<211> 11 

<212> DNA 

<213> Mus musculus 



<400> 10 
gggaagtacg c 

<210> 11 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 11 
acctcggatg a 

<210> 12 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 12 
ttccaggccc g 

<210> 13 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 13 
accagtgtcg c 

<210> 14 
<211> 11 
<212> DNA 



<213> Mus musculus 

<400> 14 
tgcatgccct g 

<210> 15 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 15 
cactacagca c 

<210> 16 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 16 
ctgccaagtt c 

<210> 17 

<211> 48 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 



<400> 17 

aagcttccat gctgttggat ctctgcagtg tgagatctgc acgaattc 



<210> 18 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 18 

gaattcgtgc agatctcaca ctgca 

<210> 19 

<211> 13 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 19 
gagatccaac age 

<210> 20 

<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 20 



atggaagctt 

<210> 21 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 21 

gtgtgagatc tgcacgaatt c 

<210> 22 

<211> 13 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 22 
ttggatctct gca 

<210> 23 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 



<400> 23 
aagcttccat gctg 

<210> 24 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 

<400> 24 

cggagtttaa acggattgga gccagc 

<210> 25 

<211> 62 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 

<400> 25 

gctccaatcc gtttaaactc cgtttttttt tttttttttt tttttttttt tttttttttt 
tt 

<210> 26 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 

<400> 26 

cggagtttaa acggattgga gccagcatgg aagctt 

<210> 27 

<211> 76 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 
<400> 27 

aagcttccat gctggctcca atccgtttaa 
tttttttttt tttttt 

<210> 28 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



actccgtttt tttttttttt tttttttttt 



<220> 

<223> Synthetic DNA 
<220> 

<221> polyA_signal 
<222> (1) . . (24) 
<223> 



-10- 



<400> 28 

aaaaaaaaaa aaaaaaaaaa aaaa 



<210> 29 

<211> 66 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 
<400> 29 

aagcttccat gctggctcca atccgtttaa actccgtttt tttttttttt tttttttttt 
tttttt 

<210> 30 

<211> 104 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 
<220> 

<22 1> misc_f eature 

<222> (1) . . (48) 

<223> n = a, c, g, or t 

<400> 30 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnaa aaaaaaaaaa 



aaaaaaaacg gagtttaaac ggattggagc cagcatggaa gctt 



<210> 31 

<211> 104 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 
<220> 

<2 2 1> misc_f eature 

<222> (57) . . (104) 

<223> n = a, c, g, or t 



<400> 31 

aagcttccat gctggctcca atccgtttaa actccgtttt tttttttttt ttttttnnnn 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnn 104 



<210> 32 

<211> 96 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic DNA 
<220> 

<2 21> mi sc_f eature 

<222> (16) . . (45) 

<223> n = a, c, g, or t 



-12- 



<400> 32 

gaattcgtgc agatcnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnaaaaa aaaaaaaaaa 
cggagtttaa acggattgga gccagcatgg aagctt 

<210> 33 

<211> 96 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<22 1> mi sc_f eature 

<222> (52) . . (81) 

<223> n = a, c, g, or t 

<400> 33 

aagcttccat gctggctcca atccgtttaa actccgtttt tttttttttt tnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn ngatctgcac gaattc 

<210> 34 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 



-13- 



<220> 

<22 1> misc_f eature 

<222> (16) . . (28) 

<223> n = a, c, g, or t 

<400> 34 

gaattcgtgc agatcnnnnn nnnnnnnn 

<210> 35 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<2 2 1> mi sc_f eature 

<222> (1 ) . . (17) 

<223> n = a, c, g, or t 

<400> 35 

nnnnnnnnnn nnnnnnnaaa aaaaaaaaaa aacggagttt 

<210> 36 

<211> 28 

<212> DNA 

<213> Artificial Sequence 



-14- 



<220> 

<223> Synthetic DNA 

<400> 36 

aaacggattg gagccagcat ggaagctt 

<210> 37 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<2 21> misc_f eature 

<222> (1 ) . . (11) 

<223> n = a, c, g, or t 

<400> 37 

nnnnnnnnnn ngatctgcac gaattc 

<210> 38 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

-15- 



<221> 



misc feature 



<222> 



(24) . . (42) 



<223> 



n = a, 



c, g, 



or t 



<400> 38 

aaactccgtt tttttttttt tttnnnnnnn nnnnnnnnnn nn 



42 



<210> 39 

<211> 28 . 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 

<400> 39 

aagcttccat gctggctcca atccgttt 28 

<210> 40 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<2 2 1> misc_f eature 

<222> (16) . . (26) 

<223> n = a, c, g, or t 



-16- 



<400> 40 

gaattcgtgc agatcnnnnn nnnnnnaaac ggattggagc cagcatggaa gctt 

<210> 41 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<2 21> misc_f eature 

<222> (29) . . (39) 

<223> n = a, c, g, or t 

<400> 41 

aagcttccat gctggctcca atccgtttnn nnnnnnnnng atctgcacga attc 

<210> 42 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<2 2 1> mi sc__f eature 

<222> (5) . . (15) 



-17- 



<223> n = a, c, g, ort 



<400> 42 

gatcnnnnnn nnnnnaaacg gate 

<210> 43 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 

<221> mi sc_f eature 

<222> (10) . . (20) 

<223> n = a, c, g, or t 

<400> 43 

gatccgtttn nnnnnnnnnn gate 

<210> 44 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic DNA 
<220> 



<2 2 1> misc_f eature 

<222> (5) . . (15) 

<223> n = a, c, g, or t 

<400> 44 

gatcnnnnnn nnnnnaaacg 

<210> 45 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Synthetic DNA 
<220> 

<22 1> mi sc_f eature 

<222> (10) . . (20) 

<223> n = a, c, g, or t 



<400> 45 

gatccgtttn nnnnnnnnnn 



